Scalar, 2D{1} datasets

The 2D{1} datasets are two dimensional, \(d=2\), with one single-component dependent variable, \(p=1\). Here, we present examples of 2D{1} datasets with the scalar dependent variable.

Astronomy dataset

The following dataset is a new observation of the Bubble Nebula acquired by The Hubble Heritage Team, in February 2016. The original dataset was obtained in the FITS format and subsequently converted to the CSD model file-format. For the convenience of illustration, we have downsampled the original dataset.

Let’s load the .csdfe file and look at its data structure.

>>> import csdmpy as cp
>>> bubble_nebula = cp.load('Test Files/BubbleNebula/Bubble.csdfe')
>>> print(bubble_nebula.data_structure)
{
  "csdm": {
    "version": "1.0",
    "read_only": true,
    "timestamp": "2016-02-26T16:41:00Z",
    "description": "The dataset is a new observation of the Bubble Nebula acquired by The Hubble Heritage Team, in February 2016.",
    "dimensions": [
      {
        "type": "linear",
        "count": 1024,
        "increment": "-0.0002581136196 °",
        "coordinates_offset": "350.311874957 °",
        "quantity_name": "plane angle",
        "label": "Right Ascension"
      },
      {
        "type": "linear",
        "count": 1024,
        "increment": "0.0001219957797701109 °",
        "coordinates_offset": "61.12851494969163 °",
        "quantity_name": "plane angle",
        "label": "Declination"
      }
    ],
    "dependent_variables": [
      {
        "type": "internal",
        "name": "Bubble Nebula, 656nm",
        "numeric_type": "float32",
        "quantity_type": "scalar",
        "components": [
          [
            "0.0, 0.0, ..., 0.0, 0.0"
          ]
        ]
      }
    ]
  }
}

Here, the variable bubble_nebula is an instance of the CSDM class. From the data structure, one finds two dimensions, labeled as Right Ascension and Declination, and one single-component dependent variable named Bubble Nebula, 656nm.

Let’s get the tuples of the dimension and dependent variable instances from the bubble_nebula instance following,

>>> x = bubble_nebula.dimensions
>>> y = bubble_nebula.dependent_variables

There are two dimension instances in x. Let’s look at the coordinates along each dimension, using the coordinates attribute of the respective instances.

>>> print(x[0].coordinates[:10])
[350.31187496 350.31161684 350.31135873 350.31110062 350.3108425
 350.31058439 350.31032628 350.31006816 350.30981005 350.30955193] deg

>>> print(x[1].coordinates[:10])
[61.12851495 61.12863695 61.12875894 61.12888094 61.12900293 61.12912493
 61.12924692 61.12936892 61.12949092 61.12961291] deg

Here, we only print the first ten coordinates along the respective dimensions.

The component of the dependent variable is accessed through the components attribute.

>>> y00 = y[0].components[0]

Visualizing the dataset

Now, to plot the dataset.

Tip

Intensity plot.

>>> import matplotlib.pyplot as plt
>>> from matplotlib.colors import LogNorm
>>> import numpy as np

>>> def plot():
...     # Figure setup.
...     fig, ax = plt.subplots(1,1, figsize=(4,3))
...     ax.set_facecolor('w')
...
...     # the coordinates along the two dimensions
...     x0 = x[0].coordinates
...     x1 = x[1].coordinates
...
...     # Set the extents of the image.
...     extent=[x0[0].value, x0[-1].value, x1[0].value, x1[-1].value]
...
...     # Log intensity image plot.
...     im = ax.imshow(np.abs(y00), origin='lower', cmap='bone_r',
...                    norm=LogNorm(vmax=y00.max()/10, vmin=7.5e-3, clip=True),
...                    extent=extent, aspect='auto')
...
...     # Set the axes labels and the figure tile.
...     ax.set_xlabel(x[0].axis_label)
...     ax.set_ylabel(x[1].axis_label)
...     ax.set_title(y[0].name)
...     ax.locator_params(nbins=5)
...
...     # Add a colorbar.
...     cbar = fig.colorbar(im)
...     cbar.ax.set_ylabel(y[0].axis_label[0])
...
...     # Set the x and y limits.
...     ax.set_xlim([350.25, 350.1])
...     ax.set_ylim([61.15, 61.22])
...
...     # Add grid lines.
...     ax.grid(color='gray', linestyle='--', linewidth=0.5)
...
...     plt.tight_layout(pad=0, w_pad=0, h_pad=0)
...     plt.show()
>>> plot()
../../_images/Bubble.csdfe.png

Nuclear Magnetic Resonance (NMR) dataset

The following example is a \(^{29}\mathrm{Si}\) NMR time-domain saturation recovery measurement of a highly siliceous zeolite ZSM-12. Usually, the spin recovery measurements are acquired over a rectilinear grid where measurements along one of the dimensions are non-uniform and span several orders of magnitude. In this example, we illustrate the use of monotonic dimensions for describing such datasets.

Let’s load the file.

>>> import csdmpy as cp

>>> filename = 'Test Files/NMR/satrec/satRec.csdf'
>>> NMR_2D_data = cp.load(filename)

The tuples of the dimension and dependent variable instances from the NMR_2D_data instance are

>>> x = NMR_2D_data.dimensions
>>> y = NMR_2D_data.dependent_variables

respectively. There are two dimension instances in this example with respective dimension data structures as

>>> print(x[0].data_structure)
{
  "type": "linear",
  "description": "A full echo echo acquisition along the t2 dimension using a Hahn echo.",
  "count": 1024,
  "increment": "80.0 µs",
  "coordinates_offset": "-41.04 ms",
  "quantity_name": "time",
  "label": "t2",
  "reciprocal": {
    "coordinates_offset": "-8766.0626 Hz",
    "origin_offset": "79578822.26200001 Hz",
    "quantity_name": "frequency",
    "label": "29Si frequency shift"
  }
}

and

>>> print(x[1].data_structure)
{
  "type": "monotonic",
  "coordinates": [
    "1 s",
    "5 s",
    "10 s",
    "20 s",
    "40 s",
    "80 s"
  ],
  "quantity_name": "time",
  "label": "t1",
  "reciprocal": {
    "quantity_name": "frequency"
  }
}

respectively. The first dimension is uniformly spaced, as indicated by the linear subtype, while the second dimension is non-linear and monotonically sampled. The coordinates along the respective dimensions are

>>> x0 = x[0].coordinates
>>> print(x0)
[-41040. -40960. -40880. ...  40640.  40720.  40800.] us

>>> x1 = x[1].coordinates
>>> print(x1)
[ 1.  5. 10. 20. 40. 80.] s

Notice, the unit of x0 is in microseconds. It might be convenient to convert the unit to milliseconds. To do so, use the to() method of the respective Dimension instance as follows,

>>> x[0].to('ms')
>>> x0 = x[0].coordinates
>>> print(x0)
[-41.04 -40.96 -40.88 ...  40.64  40.72  40.8 ] ms

As before, the components of the dependent variable are accessed using the components attribute.

>>> y00 = y[0].components[0]
>>> print(y00)
[[  182.26953   +136.4989j    -530.45996   +145.59097j
   -648.56055   +296.6433j   ... -1034.6655    +123.473114j
    137.29883   +144.3381j    -151.75049    -18.316727j]
 [  -80.799805  +138.63733j   -330.4419    -131.69786j
   -356.23877   +463.6406j   ...   854.9712    +373.60577j
    432.64648   +525.6024j     -35.51758   -141.60239j ]
 [ -215.80469   +163.03308j   -330.6836    -308.8578j
  -1313.7393   -1557.9144j   ...  -979.9209    +271.06757j
   -667.6211     +61.262817j   150.32227    -41.081024j]
 [    6.2421875 -163.0319j    -654.5654    +372.27518j
  -1209.3877    -217.7103j   ...   202.91211   +910.0657j
   -163.88281   +343.41882j     27.354492   +21.467224j]
 [  -86.03516   -129.40945j   -461.1875     -74.49284j
     68.13672   -641.11975j  ...   803.3242    -423.6355j
   -267.3672    -226.39514j     77.77344    +80.2041j  ]
 [ -436.0664    -131.52814j    216.32812   +441.56696j
   -577.0254    -658.17645j  ... -1780.457     +454.20862j
  -1765.7441    -375.72888j    407.0703    +162.24716j ]]

Visualizing the dataset

Tip

Intensity plot with cross-sections

More often than not, the code required to plot the data become exhaustive. Here is one such example.

>>> import matplotlib.pyplot as plt
>>> from matplotlib.image import NonUniformImage
>>> import numpy as np

>>> def plot_nmr_2d():
...     """
...     Set the extents of the image.
...     To set the independent variable coordinates at the center of each image
...     pixel, subtract and add half the sampling interval from the first
...     and the last coordinate, respectively, of the linearly sampled
...     dimension, i.e., x0.
...     """
...     si=x[0].increment
...     extent = ((x0[0]-0.5*si).to('ms').value,
...               (x0[-1]+0.5*si).to('ms').value,
...               x1[0].value,
...               x1[-1].value)
...
...     """
...     Create a 2x2 subplot grid. The subplot at the lower-left corner is for
...     the image intensity plot. The subplots at the top-left and bottom-right
...     are for the data slice at the horizontal and vertical cross-section,
...     respectively. The subplot at the top-right corner is empty.
...     """
...     fig, axi = plt.subplots(2,2, figsize=(4,3),
...                             gridspec_kw = {'width_ratios':[4,1],
...                                            'height_ratios':[1,4]})
...
...     """
...     The image subplot quadrant.
...     Add an image over a rectilinear grid. Here, only the real part of the
...     data values is used.
...     """
...     ax = axi[1,0]
...     im = NonUniformImage(ax, interpolation='nearest',
...                          extent=extent, cmap='bone_r')
...     im.set_data(x0, x1, y00.real/y00.real.max())
...
...     """Add the colorbar and the component label."""
...     cbar = fig.colorbar(im)
...     cbar.ax.set_ylabel(y[0].axis_label[0])
...
...     """Set up the grid lines."""
...     ax.images.append(im)
...     for i in range(x1.size):
...         ax.plot(x0, np.ones(x0.size)*x1[i], 'k--', linewidth=0.5)
...     ax.grid(axis='x', color='k', linestyle='--', linewidth=0.5, which='both')
...
...     """Setup the axes, add the axes labels, and the figure title."""
...     ax.set_xlim([extent[0], extent[1]])
...     ax.set_ylim([extent[2], extent[3]])
...     ax.set_xlabel(x[0].axis_label)
...     ax.set_ylabel(x[1].axis_label)
...     ax.set_title(y[0].name)
...
...     """Add the horizontal data slice to the top-left subplot."""
...     ax0 = axi[0,0]
...     top = y00[-1].real
...     ax0.plot(x0, top, 'k', linewidth=0.5)
...     ax0.set_xlim([extent[0], extent[1]])
...     ax0.set_ylim([top.min(), top.max()])
...     ax0.axis('off')
...
...     """Add the vertical data slice to the bottom-right subplot."""
...     ax1 = axi[1,1]
...     right = y00[:,513].real
...     ax1.plot(right, x1, 'k', linewidth=0.5)
...     ax1.set_ylim([extent[2], extent[3]])
...     ax1.set_xlim([right.min(),  right.max()])
...     ax1.axis('off')
...
...     """Turn off the axis system for the top-right subplot."""
...     axi[0,1].axis('off')
...
...     plt.tight_layout(pad=0., w_pad=0., h_pad=0.)
...     plt.subplots_adjust(wspace=0.025, hspace=0.05)
...     plt.show()
>>> plot_nmr_2d()
../../_images/satRec.csdf.png

Transmission Electron Microscopy (TEM) dataset

The following TEM dataset is a section of an early larval brain of Drosophila melanogaster used in the analysis of neuronal microcircuitry. The dataset was obtained from the TrakEM2 tutorial and subsequently converted to the CSD model file-format.

Let’s import the CSD model data-file and look at its data structure.

>>> import csdmpy as cp
>>> import matplotlib.pyplot as plt

>>> filename = 'Test Files/TEM/TEM.csdf'
>>> TEM = cp.load(filename)
>>> print(TEM.data_structure)
{
  "csdm": {
    "version": "1.0",
    "read_only": true,
    "timestamp": "2016-03-12T16:41:00Z",
    "description": "TEM image of the early larval brain of Drosophila melanogaster used in the analysis of neuronal microcircuitry.",
    "dimensions": [
      {
        "type": "linear",
        "count": 512,
        "increment": "4.0 nm",
        "quantity_name": "length",
        "reciprocal": {
          "quantity_name": "wavenumber"
        }
      },
      {
        "type": "linear",
        "count": 512,
        "increment": "4.0 nm",
        "quantity_name": "length",
        "reciprocal": {
          "quantity_name": "wavenumber"
        }
      }
    ],
    "dependent_variables": [
      {
        "type": "internal",
        "numeric_type": "uint8",
        "quantity_type": "scalar",
        "components": [
          [
            "126, 107, ..., 164, 171"
          ]
        ]
      }
    ]
  }
}

This dataset consists of two linear dimensions and one single-component dependent variable. The tuples of the dimension and the dependent variable instances from this example are

>>> x = TEM.dimensions
>>> y = TEM.dependent_variables

and the respective coordinates (viewed only for the first ten coordinates),

>>> print(x[0].coordinates[:10])
[ 0.  4.  8. 12. 16. 20. 24. 28. 32. 36.] nm

>>> print(x[1].coordinates[:10])
[ 0.  4.  8. 12. 16. 20. 24. 28. 32. 36.] nm

For convenience, let’s convert the coordinate from nm to µm using the to() method of the respective Dimension instance,

>>> x[0].to('µm')
>>> x[1].to('µm')

and plot the data.

>>> def plot_image():
...     plt.figure(figsize=(4,3))
...
...     # Set the extents of the image plot.
...     extent = [x[0].coordinates[0].value, x[0].coordinates[-1].value,
...               x[1].coordinates[0].value, x[1].coordinates[-1].value]
...
...     # Add the image plot.
...     im = plt.imshow(y[0].components[0], origin='lower', extent=extent, cmap='gray')
...
...     # Add a colorbar.
...     cbar = plt.gca().figure.colorbar(im)
...     cbar.ax.set_ylabel(y[0].axis_label[0])
...
...     # Set up the axes label and figure title.
...     plt.xlabel(x[0].axis_label)
...     plt.ylabel(x[1].axis_label)
...     plt.title(y[0].name)
...
...     # Set up the grid lines.
...     plt.grid(color='k', linestyle='--', linewidth=0.5)
...
...     plt.tight_layout(pad=0, w_pad=0, h_pad=0)
...     plt.show()
>>> plot_image()
../../_images/TEM.csdf.png